Ontology-driven Information Retrieval in FF-Poirot

نویسندگان

  • Roberto Basili
  • Marco Cammisa
  • Maria Vittoria Marabello
  • Marco Pennacchiotti
  • Dario Saracino
  • Fabio Massimo Zanzotto
چکیده

This paper proposes a new approach for supporting domain information retrieval and information extraction on the web, using an original query expansion technique supported by an ad-hoc ontology focused on a specific domain of interest. The system has been built and tested in the framework of the FF-Poirot project, for supporting fine-grain retrieval from the Internet aiming at detecting financial fraudent sites. In a first stage, using a short list of keywords given by the user, the application mines the web retrieving relevant documents. These documents are then clustered into coherent groups focusing on specific subjects. The ontology model is devoted to represent the most important concepts of the domain of interest and to link them to the user need as expressed by the keywords. Once clusters of documents are made available after the first stage, the ontology can be used to extract from these clusters the most interesting documents (the most probable fraudolent sites in the framework of the FF-Poirot application ). Browsing the ontology and selecting specific concepts, the user starts a query expansion engine that refines the search, creating a new query based on terminological evidences tied in the ontology to the selected concepts. The paper describes the overall software architecture of the application as used in the project, focusing specifically on the query exapansion engine and the supporting ontological model adopted. Experimental evidences, as emerged in FF-Poirot, will be used to prove the feasibility and the advantages of the adopted technique.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparison of a Regulatory Ontology with Existing Legal Ontology Frameworks

In this paper we describe the nature of a regulatory ontology to be developed to support systems that tackle financial fraud. This work is part of the FF POIROT European IST project. We describe existing legal ontologies and examine then how these can be re-used to realize the ontology requirements identified for FF POIROT. We will discuss the proposed categories and their limitations for the o...

متن کامل

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

An Architecture Framework for Ontology Development and Deployment

This paper identifies requirements for an ontology development platform to facilitate methodical ontology engineering and ontology application development. It introduces the DOGMA ontology framework, developed with insights from semantic modeling and methodology in database engineering, at STARLab, VUB. It has been adopted for ontology modeling and the development of ontology facilities in such...

متن کامل

The socio - cognitive theory in information retrieval (IR)

Abstract Background and Aim: The socio-cognitive theory introduced in information science by Horland and Alberchtsen. The socio-cognitive view turns the traditional cognitive program upside down. The socio-cognitive theory emphasizes on different cultural and social structures of users. Hence, the aim of the article is to explain the role of socio - cognitive theory in information retrieval (I...

متن کامل

An Architecture Framework of Ontology Development and Deployment

This paper identifies requirements for an ontology development platform to facilitate methodical ontology engineering and ontology application development. It introduces the DOGMA ontology framework, developed with insights from semantic modeling and methodology in database engineering, at STARLab, VUB. It has been adopted for ontology modeling and the development of ontology facilities in such...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005